Auto-focus in low-profile folded optics multi-camera system

ABSTRACT

An image capturing system and a method of autofocusing are disclosed such that, for example, when a folded optics configuration is used, a field corrector lens can be placed on the image sensor of the system and a plurality of lenses can be placed perpendicular to the image sensor. The plurality of lenses can be movable relative to the image sensor such that acceptable MTF curve performances can be obtained when the image capturing system is focused at reference distances.

CROSS REFERENCE TO RELATED APPLICATIONS

The present application claims the benefit under 35 U.S.C. §119(e) of U.S. Provisional Patent Application No. 61/975,680, filed on Apr. 4, 2014, entitled “METHOD OF AUTO-FOCUSING MULTI-CAMERA SYSTEM USING FOLDED OPTICS,” and of U.S. Provisional Patent Application No. 62/015,364, filed on Jun. 20, 2014, entitled “METHOD OF AUTO-FOCUSING MULTI-CAMERA SYSTEM USING FOLDED OPTICS,” the contents of each of these applications are hereby incorporated by reference herein.

TECHNICAL FIELD

The present disclosure relates to imaging systems, and particularly to autofocusing a multi-sensor imaging system having folded optics.

BACKGROUND

Many mobile devices, such as mobile phones and tablet computing devices, include cameras that may be operated by a user to capture still and/or video images. Because such mobile devices are typically designed to be relatively thin, it can be important to design the cameras or imaging systems to be as thin as possible in order to maintain a low-profile mobile device. One of the limiting factors as to how thin a mobile camera, imaging system or device can be constructed is the camera, as traditional mobile device cameras have a number of optical elements (e.g., lens system, autofocus assembly, and image sensor) arranged linearly along the height of the device. Accordingly, the optical stack height including optical elements (e.g. refractive optical components, support structures such as the lens, barrel or optical element spacers), the focal length of the lens system, autofocus mechanisms, and possibly other camera elements limit how thin a mobile device can be constructed. As the device becomes thinner the focal length of the camera may need to be shortened, which may make the image circle diameter decrease. If it is desired to increase the number of image sensor pixels then normally the pixel pitch will need to be made smaller or the camera field of view (FoV) of the scene in the object space will need to be increased. If it is not possible to reduce the pixel pitch then the FoV of the camera may need to be increased. At some point it may not be practical or possible to continue decreasing the focal length by increasing the FoV or decreasing the pixel pitch. Accordingly, it may be desired to have lower profile image capture devices without having to shorten the focal length or decrease the resolution of the image.

SUMMARY

Folded optic image sensor arrays allow for the creation of low-profile image capture devices without having to shorten the focal length. Some folded optic image sensor arrays employ a central mirror or prism with multiple facets to split incoming light comprising the target image of the scene into multiple portions for capture by the sensors in the array, wherein each facet directs a portion of the light from the target image toward a sensor in the array. Each portion of the split light may be passed through a lens assembly and reflected off of a surface positioned directly above or below a sensor, such that each sensor captures a portion of the image. The sensor fields of view can overlap to assist in stitching together the captured portions into a complete image.

Due to the reflection of light off of multiple surfaces toward multiple sensors and the height limitations on the camera, traditional autofocus modules and techniques are not adapted for such folded optic low-profile sensor arrays. The folded optics and other structural features of such sensor arrays can make autofocus mechanisms difficult to implement. Moving an autofocus lens assembly up and down over each sensor, as typically done today for most mobile devices with cameras, would increase the height of the system and may change the incident angle and/or the relative positioning of the optical axes with respect to an orthogonal line of the imaging plane.

As stated above, another problem with autofocus in folded optic array cameras is a small form factor (typically 4.5 mm or less), where ultra-high resolution across the image height is needed. Satisfying both height constraints and performance requirements is difficult to achieve with wide Field of View (FoV) lenses. The most straightforward way to focus the lens is to lift the entire lens assembly up and down over the sensor, but this may change the position of the optical axis of one camera with respect to the optical axis of each of the other cameras as well as increase the overall height of the system. An alternative approach is needed and is described below.

The aforementioned problems, among others, are addressed by the folded optic array camera autofocus techniques described herein for providing an autofocused image to each sensor. By redirecting light toward each sensor in the array using a primary and secondary surface, and by positioning the lens assemblies used to focus the incoming light between the primary and secondary surfaces, the sensor array may be positioned on a flat substrate parallel to a movable portion of the lens assembly. The longer focal length of such an array camera makes it possible to implement features such as optical zoom and to incorporate more complicated optics that require more space than commonly afforded by the traditional mobile camera, such as adding more optical elements. For example, the use of multiple lenses may increase the focal length of the camera and thus increase the camera's FoV as done for optical zoom lenses when more resolution is desired and likewise when the FoV is desired to be wider the focal length can be decreased. Further, the use of multiple lenses across the field of view of the system can increase the total effective resolution across the entire field of view of the multi-camera array (also referred to as the “synthetic aperture”).

In some embodiments, a lens system design enables lateral motion of a movable portion of a lens assembly within the mechanical tolerances of the folded optic system while maintaining good image performance, for example defined by having acceptable modulation transfer function (MTF) values and a focus range between 20 cm and infinity. The movable portion can be moved in a direction parallel to a plane formed by the image sensor. The lens system can additionally include a stationary portion of the lens assembly. In some embodiments, two or more movable lens assembles can be incorporated to implement Zoom and AF. In some implementations, the stationary portion of the lens assembly can be a field corrector lens placed in close proximity to the image sensor, for example affixed to a glass cover plate positioned over the sensor.

An autofocus assembly using the two-part lens system design described above can implement a guide rail and an actuator in some embodiments. For example, the movable portion of the lens assembly can be coupled to an actuator that moves the movable portion through a range of positions to achieve different focal lengths. In some embodiments, the movable portion can be coupled to the actuator by a guide rail passing along an edge of a secondary sensor prism, the secondary sensor prism positioned below the sensor. By moving the guide rail along the edge of the secondary sensor prism, the autofocus assembly can laterally (e.g., in a direction parallel to the plane formed by the image sensor) move the movable portion of the lens assembly while restriction tilt, roll, pitch, and yaw within the tolerances of the lens design.

In some embodiments, an autofocus assembly using a two-part lens system design as described above can be provided for each sensor in a folded optic array.

One aspect relates to an image capture system comprising a plurality of cameras configured to capture a corresponding plurality of portions of a target image scene, a camera of the plurality of cameras comprising an image sensor; a primary light folding surface configured to redirect light representing one portion of the corresponding plurality of portions of the target image scene in a first direction toward the image sensor; an optical element having an input surface configured to receive the light from the primary light folding surface, a secondary light folding surface configured to redirect the light in a second direction toward the image sensor, and an output surface through which light redirected by the secondary light folding surface propagates in the second direction toward the image sensor; a lens assembly comprising a stationary portion having a first surface coupled to the output surface of the optical element and a second surface coupled to the image sensor, and a movable portion positioned between the primary light folding surface and the optical element; an actuator configured to move the movable portion of the lens assembly along the first direction; and at least one guide rail coupled between the actuator and the movable portion of the lens assembly, the at least one guide rail positioned to slidably engage another surface within the camera to constrain motion of the movable portion of the lens assembly away from an optical axis or rotating around the optical axis, the optical axis substantially parallel to the first direction; and a processor configured to generate a final image of the target image scene based at least partly on the corresponding plurality of portions.

Another aspect relates to a method of manufacturing an image capturing system, the method comprising, for each of a plurality of cameras in an array disposing an image sensor on a substrate; affixing a cover glass to a light receiving surface of the image sensor; placing the image sensor in a slot of a substrate; placing a primary light folding surface in an aperture in the substrate, the primary light folding surface positioned to redirect light representing one portion of a plurality of portions of a target image scene in a first direction toward the image sensor; affixing a stationary lens to the cover glass; affixing an optical element comprising a secondary light folding surface to the stationary lens, the secondary light folding surface positioned to redirect the light in a second direction toward the image sensor; providing a movable lens assembly in a space formed between the optical element and the primary light folding surface; and coupling actuating means to the movable lens assembly to move the movable lens assembly along an optical axis substantially parallel to the first direction such that the actuating means constrains movement of the movable lens assembly away from the optical axis or rotating around the optical axis.

Another aspect relates to an autofocus device for a folded optic imaging system having an image sensor and a primary light folding surface, the autofocus device comprising a lens assembly comprising a stationary portion coupled to the image sensor, and a movable portion positioned between the image sensor and the primary light folding surface; a motor configured to move the movable portion of the lens assembly along an optical axis in a direction substantially parallel to a plane formed by the image sensor; and at least one guide rail coupled between the motor and the movable portion of the lens assembly, the at least one guide rail positioned to slidably engage another surface within the folded optic imaging system to constrain motion of the movable portion of the lens assembly away from the optical axis or rotating around the optical axis.

Another aspect relates to an image capture apparatus comprising means for capturing a plurality of portions of a target image scene; means for providing a stationary portion of a lens assembly; means for providing a movable portion of a lens assembly; means for moving the movable portion of the lens assembly; and means for controlling motion of the movable portion of the lens assembly by slidably engaging at least one adjacent surface.

BRIEF DESCRIPTION OF THE DRAWINGS

The disclosed aspects will hereinafter be described in conjunction with the appended drawings and appendices, provided to illustrate and not to limit the disclosed aspects, wherein like designations denote like elements.

FIG. 1A illustrates a cross-sectional side view of an embodiment of a folded optic sensor assembly showing one sensor assembly and an associated autofocus device.

FIG. 1B illustrates a cross-sectional side view of an embodiment of a folded optic sensor assembly showing two sensor assemblies and associated autofocus devices.

FIG. 2 illustrates a block diagram of one embodiment of an image capture device.

FIG. 3 illustrates an embodiment of a folded optic image capture process with autofocus.

FIG. 4 illustrates a perspective view of an embodiment of a folded optic sensor assembly.

FIG. 5 illustrates a perspective view of an imaging system using multiple sensor assemblies.

FIG. 6 illustrates an embodiment of the projected fields of view of the folded optic sensor array embodiment.

FIG. 7A illustrates an embodiment of a folded optic sensor array having a lens assembly positioned at −30 μm and focused a distance of 6767 mm.

FIG. 7B illustrates a simulated MTF versus field performance data for the optics (e.g., lens assembly and sensor prism) of a folded optic sensor array embodiment of FIG. 7A.

FIG. 7C illustrates an embodiment of a folded optic sensor array having a lens assembly positioned at 0.0 μm and focused a distance of 1000 mm.

FIG. 7D illustrates a simulated MTF versus field performance data for the optics of a folded optic sensor array embodiment of FIG. 7C.

FIG. 7E illustrates an embodiment of a folded optic sensor array having a lens assembly positioned at 142 μm and focused a distance of 200 mm.

FIG. 7F illustrates a simulated MTF versus field performance data for the optics of a folded optic sensor array embodiment of FIG. 7E.

FIGS. 8A-C illustrate an embodiment of where a folded optic sensor assembly is moved to obtain auto-focus.

FIG. 9 illustrates a ray trace of light as it passes through an embodiment of a lens assembly for a folded optic image sensor.

DETAILED DESCRIPTION

Introduction

Embodiments of the auto-focus systems and techniques for folded optic, multi-camera systems as described herein can include a two-part lens system and an autofocus assembly provided for each image sensor in the multi-camera system. The multi-camera system can include a primary and secondary light folding surface associated with each of a plurality of image sensors. The primary light folding surface can be a reflective mirror or refractive prism, can be mounted to a substrate, and can split incoming light from the target image scene into multiple portions corresponding to the number of image sensors in the multi-camera system. The secondary light folding surface can be a reflective mirror or refractive prism, and can redirect the portion of light from the primary light folding surface toward the image sensor, for example where the image sensor is mounted flat on the same substrate to which the primary light folding surface is mounted.

One important aspect of such an embodiment is that, by redirecting light toward each sensor in the array using one or more reflective surfaces or prisms, it is possible to position all image sensors on a common flat substrate. In turn this enables putting all sensor surfaces on one common die, where circuitry can be shared between all sensors, possibly reducing the die area, power requirements and interfaces in and outside of the die.

As stated above, the traditional method of moving the lens up or down above the image sensor may not be desirable because it may increase the camera module height and potentially create other undesirable characteristics or aspects.

Another potential problem with autofocus in folded optic systems can be the need to achieve high modulation transfer function (MTF) resolution performance across the full field of view (FoV) of the image projected on the image sensor surface. The maximum MTF performance of a lens assembly is bounded by the diffraction limit of the lens assembly, which is determined by the f-number and the wavelengths of light passing through the lens assembly. The lens assembly is made up of one or more optical elements from the first surface of the lens assembly to the last surface of the lens assembly that projects the image onto another surface, such as an image sensor surface. An element can be made of one or more optical surfaces that can, for example, refract light or reflect light.

While the lens assembly actual MTF performance can be less than the diffraction limit across the full range of image heights used, it is desirable to design the lens so that it is possible to fabricate a set of samples or large quantity of lens assemblies that are very close to the diffraction limit of the lens assembly across the full FoV of the lens assembly.

As the MTF performance requirements of a lens assembly increase towards its maximum performance (e.g., the diffraction limit), this may place more demands on the tolerances of the mechanical autofocus mechanisms and/or the optical components of the lens assembly, including other aspects of the camera design. Examples of optical components or other camera design aspects that may require tighter tolerances are the autofocus mechanisms, the lens surfaces, lens materials, the alignment of the lens surfaces with respect to one another, and the combined fabrication of the autofocus mechanisms with the lens assembly as a single operating unit. The mechanical autofocus mechanisms can, for example, create lens tilt errors (e.g. rotational errors about the optical axis) and/or translational errors (e.g. X, Y and Z linear direction errors about the optical axis). In a design intended for fabrication ranging from just a few samples to large quantities it is generally a good practice to establish limits for all key variations, such as lens tilt and lens translation, and then determine a tolerance budget for all components, elements, or aspects that can create these variations, such as lens tilt or translation, and those components, elements or aspects that can be influenced by those variations. The influence normally is expressed in MTF reduction as one or more key variations change in amount. After determining the tolerance budget for all mechanical and optical components, elements or design aspects, those components, elements or aspects can then be designed so they stay within the budgeted tolerances with a certain level of statistical confidence. The use of such practices can, for example, increase the yield of the final finished product, such as the complete single or multi camera autofocus module. By viewing this system in this way, the lens assembly can be designed to be less sensitive to factors caused by the aforementioned variations or to contribute less to the aforementioned variations.

When auto-focus mechanisms or other similar descriptions or references are used herein, such a reference can include all related linkages, components, elements or aspects associated or not associated with the process of focusing a lens. For example autofocus mechanisms can include one or more motors, one or more actuators, linkages, devices, components, elements, or aspects that may cause or pass motion, where this motion in turn will move or cause action to bring a lens system into a certain level of focus. Other factors may affect the lens assembly MTF without the motor or motion from the motor. The level of the focus can be expressed in various ways such as in terms of MTF, Pulse Spread Function (PSF), or by other suitable measures.

Though discussed herein primarily in the context of MTF performance, this is for illustrative purposes, and lens performance can be expressed in other embodiments by similar concepts such as PSF, Line Spread Function (LSF) or other direct or indirect ways of expressing similar concepts.

The embodiments described herein may be used for folded optics, high MTF resolution auto-focus designs where the lens assembly design and autofocus mechanical structure design can work together to reduce the variations that can cause the MTF resolution to decrease, and/or to reduce the MTF sensitivity of the lens assembly, elements, aspects for the types and magnitude of the variations that may occur. The range of possible variations that can lead to loss or reduction in the MTF performance can come for secondary sources, tertiary sources, or the like, that are affected by the aforementioned possible variations, or others variations and in turn influence or reduce the MTF performance.

One example of a process to design a folded optics system is to start with the image sensor pixel pitch. The lens will need to act as an anti-aliasing filter in the optical domain. If image sensor pixel pitch is not taken into account at the beginning of the lens design process, then the resulting lens design may filter out scene frequency content, in cycles per mm at the focus plane, that are below the Nyquist sample rate of the image sensor. In addition, the resulting lens design may allow too much above Nyquist scene frequency content in cycles per mm to pass through, in which case the image may have noticeable aliasing artifacts. As a generally accepted rule, the lens system should reduce the MTF to 20% or slightly less at the Nyquist rate in cycles per mm. The diffraction limit can then be used as starting point for the lens design, where the f-number can be determined that would meet the 20% or slightly less rule. Once the f-number is determined then an amount to increase the diffraction limit can be estimated so that the final lens assembly design will have 20% MTF or less at the Nyquist rate. For example, if the lens final MTF is 80% less than the diffraction limit near the Nyquist frequency, in cycles per mm, then the f-number potentially could be increased to help achieve the 20% or slightly less rule.

The more the diffraction limit is increased, the wider the clear aperture will need to be increased, provided the effective focal length remains approximately constant. As the clear aperture increases the height of the lens assembly may increase. In order to keep the folded optics as thin as possible it is accordingly important to design the lens assembly so the MTF performance is as close as possible to the diffraction limit. Otherwise, it may not be possible to meet the module height, or thinness, requirements for the entire single or multi camera autofocus module. Those skilled in the art will recognize the f-number is equal to the effective focal length divided by the clear aperture of the imaging system, such as a camera lens system or assembly.

For the embodiments presented herein, the lens assemblies were designed to remain as close as possible to the diffraction limit across all scene frequency content rates in cycles per mm up to the Nyquist rate and all the way out to the diffraction limit vanishing point. In addition, the MTF performance was designed to remain as close as possible to the diffraction limit across the full FoV of the lens assembly, and at all focus distances from infinity to a near distance of 200 mm.

The embodiments presented herein, as examples, are based on using an imaging sensor square pixel array where the pixel pitch is 1.1 μm and the pixel fill factor is 100%. The embodiments and examples described below therefore are based on the Nyquist rate of 454 per mm. Those knowledgeable about sample theory will recognize that a square aperture width, such as 1.1 μm, may introduce sampling MTF roll-off. This sampling MTF roll-off can be calculated. The diffraction limit can be increased further to compensate for the sampling MTF roll-off so that at the Nyquist rate the lens MTF roll-off plus the sampling MTF roll-off will produce a net 20% MTF altogether; or some other slightly less MTF level as the case may be.

It should also be recognized that the embodiments presented herein are not limited to any pixel size, shape, pitch, rectangular array, non-rectangular array, or arrangement where the pixel size or shape can be different from one another on the surface of the image sensor. The embodiments are intended to point out the factors or aspects that are needed to design such a system and the benefits, attributes and claims of the system being described herein. The embodiments are not limited to the pixel size or other factors covered when describing or referring to those embodiments.

The embodiments presented herein can be implemented using a refractive sensor prism or a reflective mirror over the sensor. The refractive sensor prism can use total internal reflection properties to reflect light towards the sensor surface or a reflective surface on the refractive prism shaped optical element.

For the embodiments presented herein the sensor prism reflective surface and also the sensor mirror surface can have the most sensitivity to rotation and translational variations. These variations can come from the operation of the autofocus mechanisms, the motor, and interactions of the motor with other mechanical and/or optical components, elements or aspects as well as other environmental conditions such as motion, temperature, and shock. The rotation and translational variations can come from other related or unrelated sources. Other aspects can also have an impact on the MTF performance.

The embodiments described herein utilize methods intended to reduce the aforementioned variations.

In some examples, the two-part lens system can include a movable portion positioned between the primary and secondary light folding surfaces of the folded optical path of one image sensor. The movable portion of the lens assembly can move laterally (e.g., in a direction parallel to the plane formed by the image sensor) between the primary and secondary light folding surfaces to change the focal depth of an image captured by the sensor. The movable portion may include a number of lenses selected to produce the desired focal length and resolution. The two-part lens system can also include a stationary portion, for example a field corrector lens positioned in close proximity to the sensor. In some embodiments, the field corrector lens may be affixed (e.g., glued or mechanically held in place) to a glass cover plate positioned over the sensor.

In some embodiments, the autofocus assembly used to move the movable portion of the lens system can include an actuator and a guide rail or other guiding device. The actuator may be a voice coil motor (VCM), micro-electronic mechanical system (MEMS), piezoelectric motor, or a shape memory alloy (SMA). The actuator can be coupled to the substrate on the opposite side of the secondary light folding surface from the movable portion of the lens assembly, and can be coupled to the movable portion by the guide rail. The guide rail can translate the actuator motion to the movable portion, and in some embodiments can engage (for example, slidably engage) a surface of the secondary light folding surface in order to constrain tilt (e.g., roll, pitch, yaw, and rotational motion) and lateral translation movements of the movable portion within the tolerances of the lens design.

Various embodiments will be described below in conjunction with the drawings for purposes of illustration. It should be appreciated that many other implementations of the disclosed concepts are possible, and various advantages can be achieved with the disclosed implementations.

Overview of Autofocus Assembly

Referring now to FIGS. 1A and 1B, one example of an embodiment of an autofocus system for a folded optic multi-sensor assembly 100A, 100B will now be described in greater detail. FIG. 1A illustrates a cross-sectional side view of an embodiment of a folded optic sensor assembly with autofocus capabilities 100A showing one sensor assembly and an associated autofocus device. FIG. 1B illustrates a cross-sectional side view of an embodiment of a folded optic sensor assembly with autofocus capabilities 100B showing two sensor assemblies and associated autofocus devices.

As shown in the example of FIG. 1A, an image sensor 125 is positioned on a substrate 150. The substrate 150 is adjacent, at one edge shown in the cross-section, to an optical element configured to re-direct light incoming light, and that includes a primary light folding surface 124. As illustrated, the primary light folding surface 124 is part of a refractive prism 145. As illustrated, sensor 125 is mounted within a rectangular slot 117 formed in the printed circuit board 195. Stud bumps 107 are part of the sensor 125 and are used to make contact with electrically conducting pads on the printed circuit board 195. The printed circuit board 195 is mounted on substrate 150 and remains stationary relative to the substrate 150. This is just one example of how the sensor 125 can be mounted to the substrate 150 and make electrical contact with a printed circuit board like 195. In some embodiments the sensor 125 can be affixed to substrate 150 using an adhesive. In some embodiments, sensor 125 may be formed as part of substrate 150, for example substrate 150 may be a silicon die or other semiconductor material suitable for forming sensor 125 in a portion thereof. As illustrated, the sensor 125 is covered by cover glass 126, and lens L6 is positioned on the other side of the cover glass 126 from the sensor 125. In some examples, the cover glass 126 is coupled to the sensor 125 during manufacturing in order to prevent contamination of a light receiving surface of the sensor. However, in some embodiments the cover glass 126 may be omitted and the lens L6 may be coupled directly to the sensor 125.

The lens L6 can be a field corrector lens in some embodiments and can be a stationary component of the L1-L6 lens system. The secondary light folding surface 135 extends away from the lens L6, and as illustrated is formed as a refractive prism 136A coupled to a support block 136B at the secondary light folding surface 135. It is possible that a mirror surface be placed between the 136A and 136B instead of using the internal reflective characteristics of a prism to redirect the light.

A movable portion 130 of the lens system including lenses L1, L2, L3, L4, and L5 is positioned between the primary light folding surface 124 and the secondary light folding surface 135. Optical axis 123 shows one example of a path that light could take as it enters the array camera 100A, is redirected off of the primary light folding surface 124, passes through the movable portion 130 of the lens system, is redirected off of the secondary light folding surface 135, passes through the lens L6 and the cover glass 126, and is incident upon the sensor 125. The movable portion 130 can move laterally (e.g., along the optical axis 123 that extends from the primary light folding surface 124 and the secondary light folding surface 135 and in a direction substantially parallel to the plane formed by the sensor 125) between a bounding edge 141 of the refractive prism 145 forming the primary light folding surface 124 and a bounding edge 131 of the refractive prism 136A forming the secondary light folding surface 135 in order to change the focus distance in the object space.

In some embodiments, the sensor 125, cover glass 126, lens L6, and the unit including the refractive prism 136A and/or block 136B (referred to herein as an “optical element”) may be adhered or otherwise affixed in the illustrated configuration such that these components are fixed together relative to one another within the camera. In some embodiments these components may be permanently, or semi-permanently fixed together such that their positions with respect to one another stay the same, which stabilizes the optical path of light through the elements. In some embodiments, as discussed above, cover glass 126 may be omitted and the remaining sensor 125, lens L6, and the refractive prism 136A and/or block 136B can be adhered or otherwise affixed to one another with the lens L6 positioned between the sensor 125 and the refractive prism 136A and/or block 136B. As illustrated, the optical element comprises an input surface (bounding edge 131) for receiving the light passed from the primary light folding surface 124 through the movable portion of the lens assembly 130, the secondary light folding surface 135 an output surface (adjacent to the lens L6), and a guide surface 186.

As used herein, the term “camera” refers to an image sensor, lens system, and a number of corresponding light folding surfaces, for example the primary light folding surface 124, movable lens assembly 130, secondary light folding surface 135, stationary lens L6, and sensor 125 as illustrated in FIG. 1A. A folded-optic multi-sensor array can include a plurality of such cameras in various configurations. For example, embodiments of array camera configurations are disclosed in U.S. Application Pub. No. 2014/0111650, filed Mar. 15, 2013 and titled “MULTI-CAMERA SYSTEM USING FOLDED OPTICS,” the disclosure of which is hereby incorporated by reference. Other array camera configurations that would benefit from the autofocus systems and methods described herein are possible.

Actuator 180 can be used to laterally move the movable portion 130. The actuator 180 may be a VCM, MEMS, piezoelectric motor, or SMA. The actuator 180 can be coupled to the movable portion 130 by guide rail 185 extending along a lower edge 186 of the refractive prism 136A and/or block 136B. The guide rail 185 can translate motion from the actuator 180 to the movable portion 130. The guide rail 185 can slidably engage lower edge 186 (or another surface within the camera, for example another surface of the refractive prism 136A and/or block 136B, an adjacent surface of a camera housing, a lower surface of the central refractive prism, a pad or block coupled to the optical element, and the like) in order to constrain tilt, roll, pitch, yaw, and translational linear motions of the movable portion 130 (that is, motion away from or twisting around the optical axis of the movable portion 130) within the tolerances of the lens design (e.g., while still providing an image of a desired quality). Although only one guide rail 185 is illustrated, some examples may include a number of guide rails 185 as needed for constraining the motion of the movable portion 130 of the lens assembly. Friction between the guide rail 185 and the lower edge 186, as well as any friction between the movable lens system 130 and surrounding components, may be reduced by any suitable means, for example ball bearings, lubricating liquids or solids, magnetic fields, or a combination thereof. In some embodiments, magnetic coils wrapped around the movable portion 130 and/or the actuator 180 can further minimize unwanted movement in the tilt, roll, pitch, yaw, and translational linear directions.

Although the guide rail 185 is primarily discussed herein as slidably engaging the lower edge 186 of a prism 136A forming the secondary light folding surface 135, the guide rail 185 may slidably engage other surfaces in other embodiments. For example, an end of the guide rail may extend past the movable portion of the lens system and slidably engage a lower surface of the prism 145 forming the primary light folding surface 124. In some embodiments, the camera may include one or more light folding surfaces as reflective mirrors. In such embodiments, the guide rail may contact an edge of one or more of the mirrors and/or mounting blocks for the mirrors in order to constrain the unwanted motion of the movable portion of the lens assembly.

Although discussed primarily within the context of multi-camera folded optic array systems such as are described herein, the autofocus assembly can be used in any folded optic system with one or more image sensors.

As shown in FIG. 1B, a sensor assembly 100B includes a pair of image sensors 105, 125 each mounted to substrate 150, movable lens assemblies 115, 130 corresponding to image sensors 105, 125, respectively, and stationary lenses L6 positioned over the cover glass 106, 126 of image sensors 105, 125, respectively (that is, the cover glass 106, 126 are positioned between the stationary lenses L6 and the image sensors 105, 125). Each movable lens assembly 115, 130 is coupled to a guide rail 184, 185, which is in turn coupled to an actuator 181, 180. The primary light folding surface 122 of refractive prism 140 directs a portion of light from the target image scene along optical axis 121 through the movable portion 115 of the lens system, is redirected off of the secondary light folding surface 110, passes through the lens L6 and the cover glass 106, and is incident upon the sensor 105. The primary light folding surface 124 of refractive prism 145 directs a portion of light from the target image scene along optical axis 123 through the movable portion 130 of the lens system, is redirected off of the secondary light folding surface 135, passes through the lens L6 and the cover glass 126, and is incident upon the sensor 125.

The image sensors 105, 125 may comprise, in certain embodiments, a charge-coupled device (CCD), complementary metal oxide semiconductor sensor (CMOS), or any other image sensing device that receives light and generates image data in response to the received image. Image sensors 105, 125 may be able to obtain image data of still photographs and may also provide information regarding motion in a captured video stream. Sensors 105 and 125 may be individual sensors or may represent arrays of sensors, such as a 3×1 array. Any suitable array of sensors may be used in the disclosed implementations.

The sensors 105, 125 may be mounted on the substrate 150 as shown in FIG. 1B. In some embodiments, all sensors may be on one plane by being mounted to the flat substrate 150. Substrate 150 may be any suitable substantially flat material. The substrate 150 can include an aperture to allow incoming light to pass through the substrate 150 to the primary light folding surfaces 122, 124. Multiple configurations are possible for mounting a sensor array or arrays, as well as the other camera components illustrated, to the substrate 150.

Primary light folding surfaces 122, 124 may be prism surfaces as illustrated, or may be a mirror or a plurality of mirrors, and may be flat or shaped as needed to properly redirect incoming light to the image sensors 105, 125. In some embodiments the primary light folding surfaces 122, 124 may be formed as a central mirror pyramid or prism. The central mirror pyramid, prism, or other reflective surface may split light representing the target image into multiple portions and direct each portion at a different sensor. For example, a primary light folding surface 122 may send a portion of the light corresponding to a first field of view toward the left sensor 105 while primary light folding surface 124 sends a second portion of the light corresponding to a second field of view toward the right sensor 125. In some embodiments in which the receiving sensors are each an array of a plurality of sensors, the light folding surfaces may be made of multiple reflective surfaces angled relative to one another in order to send a different portion of the target image scene toward each of the sensors. It should be appreciated that together the fields of view of the cameras cover at least the target image, and may be aligned and stitched together after capture to form a final image captured by the synthetic aperture of the array.

The light folding surfaces can be flat or curved in various embodiments. A light folding surface can have a curvature that is part of the optical system, whereby it alters the path of the light in a manner other than that of a flat surface. For example such a curved surface could be part of the overall lens optical design, where without using such a curved surface, the performance of the lens design and/or the focusing capability would not be achieved. The light folding surface can also have other materials or optical elements that alter light in the optical path. The other optical elements can include, but are not limited to, Diffractive Optical Elements (DOE), coatings, polarizing elements, etc.

Each sensor in the array may have a substantially different field of view, and in some embodiments the fields of view may or may not overlap. Certain embodiments of the light folding surfaces may have complicated non-planar surfaces or non-spherical surfaces to increase the degrees of freedom when designing the lens system.

After being reflected off the primary light folding surfaces 122, 124, the light may be passed through movable lens systems 115, 130 provided between the primary light folding surfaces 122, 124 and reflective surfaces 110, 135. The movable lens systems 115, 130 may be used to focus the portion of the target image which is directed toward each sensor. The autofocus assembly for the movable lens systems 115, 130 can include an actuator for moving the lens among a plurality of different lens positions. The actuator may be a voice coil motor (VCM), micro-electronic mechanical system (MEMS), or a shape memory alloy (SMA). The autofocus assembly may further include a lens driver for controlling the actuator. As depicted, sensor 105 may be positioned above light folding surface 110 and sensor 125 may be positioned above light folding surface 135. However, in other embodiments, the sensors may be beneath the light reflected surfaces, and the light reflective surfaces may be configured to reflect light downwards. Other suitable configurations of the light folding surfaces and the sensors are possible in which the light from each lens assembly is redirected toward the sensors.

Each sensor's field of view may be projected into the object space, and each sensor may capture a partial image comprising a portion of the target scene according to that sensor's field of view. In some embodiments, the fields of view for the opposing sensor arrays 105, 125 may overlap by a certain amount. To reduce the overlap and form a single image, a stitching process as described below may be used to combine the images from the two opposing sensors 105, 125. Certain embodiments of the stitching process may employ the overlap for identifying common features in stitching the partial images together. After stitching the overlapping images together, the stitched image may be cropped to a desired aspect ratio, for example 4:3 or 1:1, to form the final image.

As illustrated by FIGS. 1A and 1B, each camera has a total height H. In some embodiments, the total height H can be approximately 4.5 mm or less. In other embodiments, the total height H can be approximately 4.0 mm or less. Accordingly, a height of the movable lens systems 115, 130 also does not exceed the height H. The height H can be also be higher than 4.5 mm.

Overview of Example Image Capture System

FIG. 2 depicts a high-level block diagram of one possible embodiment of a device 200 having a set of components including an image processor 220 linked to a plurality of cameras 215 a-215 n. The image processor 220 is also in communication with a working memory 205, memory 230, and device processor 250, which in turn is in communication with electronic storage module 210 and an electronic display 225. In some embodiments, a single processor may be used instead of two separate processors as illustrated in FIG. 2. Some embodiments may include three or more processors.

Device 200 may be, or may be part of, a cell phone, digital camera, tablet computer, personal digital assistant, or the like. There are many portable computing devices in which a reduced thickness imaging system such as is described herein would provide advantages. Device 200 may also be a stationary computing device or any device in which a thin imaging system would be advantageous. A plurality of applications may be available to the user on device 200. These applications may include traditional photographic and video applications, high dynamic range imaging, panoramic photo and video, or stereoscopic imaging such as 3D images or 3D video.

The image capture device 200 includes the cameras 215 a-215 n for capturing external images. As described above, each camera may include a sensor, lens system, autofocus assembly, and light folding surfaces. The cameras 215 a-215 n may each include a sensor, lens assembly, and a primary and secondary reflective or refractive surface for redirecting a portion of a target image to each sensor, as discussed above with respect to FIG. 1A. In general, N cameras 215 a-215 n may be used, where N≧2. Thus, the target image may be split into N portions in which each sensor of the N sensor assemblies captures one portion of the target image according to that sensor's field of view. However, some embodiments may employ only one image sensor assembly, and it will be understood that cameras 215 a-215 n may comprise any number of image sensor assemblies suitable for an implementation of the folded optic imaging device described herein. The number of cameras may be increased to achieve lower z-heights of the system, as discussed in more detail below with respect to FIG. 4, or to meet the needs of other purposes, such as having overlapping fields of view similar to that of a plenoptics camera, which may enable the ability to adjust the focus of the image after post-processing. Other embodiments may have a field of view overlap configuration suitable for high dynamic range cameras enabling the ability to capture two simultaneous images and then merge them together. The cameras 215 a-215 n may be coupled to the camera processor 220 to transmit captured image to the image processor 220.

The image processor 220 may be configured to perform various processing operations on received image data comprising N portions of the target image in order to output a high quality stitched image, as will be described in more detail below. Processor 220 may be a general purpose processing unit or a processor specially designed for imaging applications. Examples of image processing operations include cropping, scaling (e.g., to a different resolution), image stitching, image format conversion, color interpolation, color processing, image filtering (e.g., spatial image filtering), lens artifact or defect correction, lens light roll-off or reduction of light level caused by vignette, and the like. Processor 220 may, in some embodiments, comprise a plurality of processors. Certain embodiments may have a processor dedicated to each image sensor. Image processor 220 may be one or more dedicated image signal processors (ISPs) or a software implementation of a processor.

As shown, the image processor 220 is connected to a memory 230 and a working memory 205. In the illustrated embodiment, the memory 230 stores capture control module 235, image stitching module 240, operating system 245, and autofocus module 250. These modules include instructions that configure the image processor 220 of device 200 to perform various image processing and device management tasks. Working memory 205 may be used by image processor 220 to store a working set of processor instructions contained in the modules of memory 230. Alternatively, working memory 205 may also be used by image processor 220 to store dynamic data created during the operation of device 200.

As mentioned above, the image processor 220 may be configured by several modules stored in the memory 230. The capture control module 235 may include instructions that control the overall image capture functions of the device 200. For example, capture control module 235 may include instructions that call subroutines to configure the image processor 220 to capture raw image data of a target image scene using the cameras 215 a-215 n. Capture control module 235 may then call the image stitching module 240 to perform a stitching technique on the N partial images captured by the cameras 215 a-215 n and output a stitched and cropped target image to imaging processor 220. Capture control module 235 may also call the image stitching module 240 to perform a stitching operation on raw image data in order to output a preview image of a scene to be captured, and to update the preview image at certain time intervals or when the scene in the raw image data changes.

Image stitching module 240 may comprise instructions that configure the image processor 220 to perform stitching and cropping techniques on captured image data. For example, each of the N cameras 215 a-215 n may capture a partial image comprising a portion of the target image according to each sensor's field of view. The fields of view may share areas of overlap, as described above. In order to output a single target image, image stitching module 240 may configure the image processor 220 to combine the multiple N partial images to produce a high-resolution target image. Target image generation may occur through known image stitching techniques.

For instance, image stitching module 240 may include instructions to compare the areas of overlap along the edges of the N partial images for matching features in order to determine rotation and alignment of the N partial images relative to one another. Due to rotation of partial images and/or the shape of the field of view of each sensor, the combined image may form an irregular shape. Therefore, after aligning and combining the N partial images, the image stitching module 240 may call subroutines which configure image processor 220 to crop the combined image to a desired shape and aspect ratio, for example a 4:3 rectangle or 1:1 square. The cropped image may be sent to the device processor 250 for display on the display 225 or for saving in the electronic storage module 210.

Operating system module 245 configures the image processor 220 to manage the working memory 205 and the processing resources of device 200. For example, operating system module 245 may include device drivers to manage hardware resources such as the cameras 215 a-215 n. Therefore, in some embodiments, instructions contained in the image processing modules discussed above may not interact with these hardware resources directly, but instead interact through standard subroutines or APIs located in operating system component 245. Instructions within operating system 245 may then interact directly with these hardware components. Operating system module 245 may further configure the image processor 220 to share information with device processor 250.

Autofocus module 255 can include instructions that configure the image processor 220 to adjust the focus position of each of cameras 215 a-215 n, for example by controlling the movement and positioning of corresponding autofocus assemblies. Autofocus module 255 can include instructions that configure the image processor 220 to perform focus analyses and automatically determine focus parameters in some embodiments, and can include instructions that configure the image processor 220 to respond to user-input focus commands in some embodiments. In some embodiments, the lens system of each camera in the array can be focused separately. In some embodiments, the lens system of each camera in the array can be focused as a group.

Device processor 250 may be configured to control the display 225 to display the captured image, or a preview of the captured image, to a user. The display 225 may be external to the imaging device 200 or may be part of the imaging device 200. The display 225 may also be configured to provide a view finder displaying a preview image for a use prior to capturing an image, or may be configured to display a captured image stored in memory or recently captured by the user. The display 225 may include a panel display, for example, a LCD screen, LED screen, or other display technologies, and may implement touch sensitive technologies.

Device processor 250 may write data to storage module 210, for example data representing captured images. While storage module 210 is represented graphically as a traditional disk device, those with skill in the art would understand that the storage module 210 may be configured as any storage media device. For example, the storage module 210 may include a disk drive, such as a floppy disk drive, hard disk drive, optical disk drive or magneto-optical disk drive, or a solid state memory such as a FLASH memory, RAM, ROM, and/or EEPROM. The storage module 210 can also include multiple memory units, and any one of the memory units may be configured to be within the image capture device 200, or may be external to the image capture device 200. For example, the storage module 210 may include a ROM memory containing system program instructions stored within the image capture device 200. The storage module 210 may also include memory cards or high speed memories configured to store captured images which may be removable from the camera.

Although FIG. 2 depicts a device having separate components to include a processor, imaging sensor, and memory, one skilled in the art would recognize that these separate components may be combined in a variety of ways to achieve particular design objectives. For example, in an alternative embodiment, the memory components may be combined with processor components to save cost and improve performance.

Additionally, although FIG. 2 illustrates a number of memory components, including memory component 230 comprising several modules and a separate memory 205 comprising a working memory, one with skill in the art would recognize several embodiments utilizing different memory architectures. For example, a design may utilize ROM or static RAM memory for the storage of processor instructions implementing the modules contained in memory 230. The processor instructions may be loaded into RAM to facilitate execution by the image processor 220. For example, working memory 205 may comprise RAM memory, with instructions loaded into working memory 205 before execution by the image processor 220.

Overview of Example Image Capture Process

FIG. 3 illustrates an embodiment of a folded optic image capture process 900. The process 900 begins at block 905, in which a plurality of cameras are provided, each having at least one light folding surface and an autofocus assembly. The cameras can form any of the sensor array configurations discussed herein. The cameras may include, as discussed above, a sensor, lens system, and a reflective surface positioned to redirect light from the lens system onto the sensor.

The process 900 then moves to block 910, in which the optical path of the plurality of cameras causes light comprising a target image of a scene to be redirected off at least one light folding surface toward the corresponding imaging sensors. For example, a portion of the light may be redirected off of each of a plurality of surfaces toward each of the plurality of sensors. This step may further comprise passing the light through a lens system associated with each sensor, and may also include redirecting the light off of a second surface onto the sensor.

The process 900 then transitions to block 915, in which a lens assembly associated with each of the cameras is moved to such a position that that an image is focused on the sensor, that is, is “focused” or “autofocused” to a desired focal position. For example, this can be accomplished using the actuator and guide rail discussed above in some embodiments. In some embodiments, the autofocus module 255 of FIG. 2 can perform the lens focusing.

The process 900 may then move to block 920, in which the sensors capture a plurality of images of the target image scene. For example, each sensor may capture an image of a portion of the scene corresponding to that sensor's field of view. Together, the fields of view of the plurality of sensors cover at least the target image in the object space.

The process 900 then may transition to block 925 in which an image stitching method is performed to generate a single image from the plurality of images. In some embodiments, the image stitching module 240 of FIG. 2 may perform this block. This may include known image stitching techniques. Further, any areas of overlap in the fields of view may generate overlap in the plurality of images, which may be used in aligning the images in the stitching process. For example, block 925 may further include identifying common features in the overlapping area of adjacent images and using the common features to align the images.

Next, the process 900 transitions to block 930 in which the stitched image is cropped to a specified aspect ratio, for example 4:3 or 1:1. Finally, the process ends after storing the cropped image at block 935. For example, the image may be stored in storage component 210 of FIG. 2, or may be stored in working memory 205 of FIG. 2 for display as a preview or review image of the target scene.

Overview of Example Autofocus Assemblies

FIG. 4 shows an array camera assembly 1000A according to an embodiment. Camera assembly 1000A comprises lens surfaces L1-L5 implemented in 1001, sensor die 1002, sensor prism 1003, lens surface L6, and sensor cover glass 1005. Sensor prism 1003 can include a mirror surface between two halves or portions of a glass cube in some embodiments.

FIG. 5 shows an array camera 1000B using multiple camera assemblies installed on a common substrate 1004 (e.g., sensor die) according to an embodiment. The array camera 1000B includes a plurality of individual camera assemblies, similar to the assembly 1000A shown in FIG. 4, each comprising lens surfaces L1-L5 implemented in 1001, sensor die 1002, sensor prism 1003, lens surface L6, and sensor cover glass 1005. For clarity, these components have only been labeled on two of the individual camera assemblies. In this example, four camera assemblies 1000A are utilized. More cameras or fewer cameras (or one camera) can also be used. In this example, substrate 1004 can provide rectangular slots where the four image sensor dies 1002 are placed and connected to electrical conducting traces that are also part of the substrate 1004. In some embodiments the sensor die 1002 may be placed directly on the substrate 1004 and connected to electrical conducting traces without utilizing slots. In other embodiments there are a variety of ways for mounting image sensor dies to a substrate that may connect to electrical conducting traces, those skilled in the art may be familiar with other such methods. Electrical connector 1106 is used to connect the electrical devices on subtracted 1004 to the camera image processing system (not shown in this Figure).

In some embodiments, one or more image sensors arrays may be on a common die such as that shown in FIG. 8 of U.S. Application Pub. No. 2014/0111650, filed Mar. 15, 2013 and titled “MULTI-CAMERA SYSTEM USING FOLDED OPTICS,” incorporated by reference above. This figure shows an example of two image sensor image surfaces on one common die 811. In this example the object prisms 820 and 821 are positioned on the outside as opposed to the center as shown in FIG. 1B. The image sensor prism or mirror 830 and 831 are shown in the center. The lens assembly of one lens is symbolized by the lens drawing 840, and likewise for a lens assembly symbolized by the lens drawing 841. The optical axes 860 and 861 are shown pointing at two separate locations on the die 811. The die can contain multiple image sensor array areas or a common image sensor array area that captures the image within the field of view of both lens assemblies 840 and 841. The concept associated with this figure can be extended to a plurality of cameras. Those skilled in the art should recognized there are other ways of aligning cameras so as to capture a plurality of images in the object space and utilize one die to capture a plurality of images associated with each camera. In some embodiments, more than one die can be used where some may have a plurality of images captured with one die and others with only one image per die.

There are advantages of being able to capture images on one die from a plurality of camera assemblies, such as that shown as 1000A on FIG. 4. Such an arrangement can reduce the collective die area and power as compared to the array camera design, such as that shown of 1000B in FIG. 5A, where one camera image is captured on one die.

Two object prisms 1010 are shown in the example of FIG. 5 where two cameras share one object prism. There are many configurations where, for example, one object prism can be used for one, two, three or more camera assemblies such assembly 1000A. They are called the “object prisms” because they are used to fold the optical axis of each camera assembly to point out into the object space. There are other possible configurations of object prisms and camera assemblies. In an embodiment an object prism 1010 may utilize a reflective surface 1011 on the prism instead of using the total internal reflection properties of the prism. The object prism 1010 may be replaced by a mirror instead of using a prism. The prism, prism reflective surface, or mirror would reflect the optical axis and the associated object space rays towards the entrance pupil of a camera.

FIG. 6 illustrates an embodiment of the projected fields of view of the folded optic sensor array embodiment. Fields of view 600A-600D may share areas of overlap 605 as shown. The overlapping fields of view 600A-600D may be stitched into a complete image of the target image scene and cropped to a final image 610 with an aspect ratio such as 4:3 or 1:1.

FIGS. 7B, 7D, and 7F, illustrate simulated MTF performance of the L1-L6 assembly together with a sensor prism within a range of motion for a movable lens assembly 705 as described herein between approximately 0 and approximately 172 μm. FIG. 7C illustrates an embodiment in which a folded optic design and process are used to focus a camera assembly at 1000 mm in the object space. In the embodiment as shown in FIG. 7C, lens assembly 705, including lens elements L1-to-L5, are moved to the reference position 0.0 micrometers (μm) by the actuator 180, which is the position the camera will be focused at 1000 mm in the object space. In each of FIGS. 7A, 7C, and 7E the positioning of the lens assembly 705 within the space 710 bounded by the edge 141 of the central prism and the edge 131 of the sensor prism is indicated by the positioning of the vertical dotted lines.

FIG. 7A illustrates an embodiment of a folded optic camera having a lens assembly positioned at −30 μm with respect to the reference position 0.0 μm. In this embodiment, as shown in FIG. 7A, the camera 1401 is focused at a hyper-focus distance of 6767 mm. FIG. 7B illustrates a simulated MTF versus field performance data for the optics (e.g., lens assembly and sensor prism) of a folded optic sensor array embodiment of FIG. 7A. As described above, FIG. 7C illustrates an embodiment of a folded optic camera having a lens assembly positioned at 0.0 μm and focused a distance of 1000 mm. FIG. 7D illustrates a simulated MTF versus field performance data for the optics of a folded optic sensor array embodiment of FIG. 7C. FIG. 7E illustrates an embodiment of a folded optic camera having a lens assembly positioned at 142 μm with respect to the reference position 0.0 μm. In this embodiment, as shown in FIG. 7E, the camera 1405 is focused at a distance of 200 mm. FIG. 7F illustrates a simulated MTF versus field performance data for the optics of a folded optic sensor array embodiment of FIG. 7E.

The MTF curves shown in 1402, 1404 and 1406 in FIGS. 7B, 7D, and 7F are examples of MTF performance of the L1-L6 assembly when the camera is focused at the distances provided above for FIGS. 7A, 7C, and 7E, respectively. The graphs show the MTF curves at angular direction, with respect to the optical axis, in the Field of View (FoV) of the camera, shown on the graphs as “Y Field in Degrees. The solid line represents the Sagittal performance and the dashed line represents the Tangential performance.

The sensor of each camera may have its own MTF, based on sampling theory that rolls off with the aperture of the pixels and sampling pitch. Therefore, in some examples, the simulated optics performance of FIGS. 7B, 7D, and 7F may not match the measured MTF performance of the overall array camera.

For the focus positions 6767 mm and 1000 mm the corresponding MTF curves are shown and are approximately equal for both Tangential and Sagittal curves across the entire image height in degrees (i.e. from 0 degree to −16 degrees about the optical axis).

For position +142 μm the sagittal performance remains near 50% MTF across the full image height from zero to −16 degrees. However the tangential MTF performance deviates from the sagittal performance as the image height is increased. This means this embodiment is near the shortest distance where useful images can be captured.

In some of the autofocus assembly embodiments described herein, L6 can be attached to the sensor prism, and then the L6 plus the sensor prism can be mounted or permanently mounted either to the image sensor cover glass or directly to the image sensor. This can prevent the sensor prism from tilting or shifting relative to the sensor while an auto-focus process is taking place or tilting or shifting under the influence of other factors such as gravity, motion or temperature.

Mounting the L6 plus the sensor prism to either the sensor or cover glass can provide benefits to overcome the observation that the MTF performance of the lens assembly design shown in FIGS. 1A and 1B is sensitive to the amount of rotational tilt error and linear translational error that the lens sensor prism plus L6 may have with respect to the ideal optical axis as it intersects the image sensor image plane. To overcome this sensitivity, the embodiments herein can provide a lens assembly and autofocus method that does not require moving the sensor prism plus L6 with respect to the image sensor. The benefits of attaching the sensor prism plus L6 to the image sensor image plane include reducing the MTF sensitivity to rotational tilt and linear translational deviations from the ideal optical axis with respect to the image sensor image plane. Once the alignment between the image sensor prism plus L6 to the image sensor plane is accurately done during the assembly process the remaining tilt and translational errors should mostly occur between the L1 to L5 lens assembly 705 and the sensor prism. Use of the guide rail or other suitable devices as described herein can serve to reduce or restrict tilt or translational errors from the ideal optical axis of the L1 to L5 lens assembly 705 with respect to the fixed unit comprised of the sensor prism, L6 and the image sensor.

FIGS. 8A through 8C illustrate one embodiment of a design 1500 on how the lens assembly L1-to-L5 130 are moved back and forth with respect to the sensor prism 136A, 136B by a motor device 1501. By moving assembly 130 back and forth the focus position in the object space can be changed. FIGS. 8A through 8C illustrate how, in this embodiment, the lens elements are moved back and forth to increase or decrease the distance between lens surfaces L5 to the sensor surface, and thereby increasing or decreasing the focal length.

FIG. 8A illustrates the complete assembly 1500, including the components described above with respect to FIG. 1A.

FIG. 8B illustrates an example of a stationary portion 1502 of a complete camera assembly 1500 including substrate 150, actuator 1501, sensor 125, cover glass 126, lens L6, refractive prism 145 including primary light folding surface 124, and secondary light folding surface 135 between refractive prism 136A and block 136B. The actuator can be secured to a support member (e.g., circuit board 195) that, in turn, is secured to the sensor substrate 150.

FIG. 8C illustrates an example of a movable portion 1503 of the camera 1500 including a guide rail 185, movable portion 130 of the lens system including lens surfaces L1-L5, and an actuator contact member 1506. The movable lens assembly 130 can include a number of lenses shaped and positioned to provide the desired focus length. The particular lens configuration shown for the movable lens assembly 130 is meant to provide an example, and any lens configuration capable of maintaining good image performance while moving within the tolerances of the folded optic system can be used. The guide rail 185 can contact the lower surface of the refractive prism 136A and block 136B to stabilize the rotational movement of the movable lens assembly 130 (in the roll, yaw and pitch directions) to within tolerances as well as the translational movement (in the up and down or left and right direction) of the movable lens assembly to within tolerances.

In this embodiment the method to hold the assemblies, such as 1503 to assembly 1502, are not shown. Examples of such methods include, but are not limited to, using glides and/or interlocking grooves. One or more magnetic fields, such as induced by magnets not requiring a power source and/or magnetic field generators that do/can require power sources, can be used to lower the resistance between mechanical parts and/or assemblies such as 1502 and 1503. For example such glides and/or interlocking groves with, for example, having two magnetic fields. One magnetic field could be around 130 and a second one could be around the motor area 180 or other locations in such as assembly 1500. Whereas the traditional mobile device lens barrel is suspended by normally one magnetic field and thereby bring about more translational X, Y and Z displacement and/or rotational displacement such as roll, pitch and yaw.

Another embodiment of a suitable folded optic system is to use the mirror surface 135 as a secondary light directing surface without a surrounding prism. Accordingly, illustrated element of prism portions 136A, 136B are removed and only the mirror surface 135 remains. A structure design to secure a mirror 135 can be used to guide the guide rail 185.

By holding the tolerances with tighter tolerances than that of a traditional mobile device, influences of forces (e.g., acceleration and deceleration of the camera system) and vibrations from influences within and outside the camera systems can be prevented, abated and/or minimized.

There are many other forms of suspension other than magnetic fields, for example, such methods that could be used include one or more of oil, ball bearings, air, gas, lubricating liquids or solids, or the like.

One advantage of the folded optic multi-sensor assemblies described herein is the ability to use long guides and one or more suspensions such as using, for example, magnetic fields, ball bearings and liquids such as oils to keep devices like, but not necessarily, 1502 and 1503 within tight tolerances. Such tolerances, for example, can be translational movement tolerances like X, Y and Z linear directions and the rotation movement tolerances like roll, pitch and yaw, where the meaning translational movement, rotational movement, pitch movement, roll movement, and yaw movement can be found in literature. The reference directions for these tolerances are not shown because it will depend on the particular design used.

Another advantage is there is room to provide structures that are electrical and/or mechanical between and around the camera assemblies 1000A and/or 1500. One such structure could be interlocking electrical and/or mechanical devices to control the focus positions for 1, 2 or more camera assemblies 1000A and/or 1500. The embodiments of the present application are not limited to mobile camera devices and are equally applicable to camera devices and imaging systems of any type.

A key advantage of folded optics is that position indicators can be used such that an appropriate process can make use of this information. There may be more room for such position indicators within the lens assembly 1000A and/or 1500. There may also be more room within the array camera housing holding one or more cameras. Such position indicators can be placed on the housing and/or assembly substrates like that shown in or on 1004, as shown in FIG. 5.

Whereas the housing is a structures that may surround the assembly camera modules and/or the assembly substrate 1004 either partially or completely.

In other embodiments, and optical designs the location of movement between lens surfaces, L1 to L6 may be different, but the same concepts as described herein apply. The number of surfaces can be different for other optical designs. Other implementations could be used such as changing the curvature of one or more surfaces such as that of a liquid lens or other technologies. Some advantages of such implementations are, for example: the optical axis of one camera relative to the others in the array does not change position, which is an important consideration when stitching images together. It is possible to implement a position indicator of the moveable lens assembly. With this information a module or an external device, like an image sensor processor (ISP), can estimate the distance the camera is focused at. Knowledge of the focus location for each camera in the array can help with how to stitch the images together and enable unique other features like provide extended (depth of field) DoF images by focusing each camera at different distances. Calibration can be used to determine within reasonable certainty whether each of the cameras has obtained good focus.

Another embodiment is remove prism block 136A and keep only the mirror surface 135. The mirror surface can be attached to a plate, a supporting block like 136B or other means. Around the mirror a structure can be placed to keep the mirror firmly aligned and stationary with respect to image plane surface of an image sensor 125, where the mirror, L6 and the image sensor 125 will not move relative to each other. The structure used to firmly hold the sensor 125, L6 and the mirror surface 135 in place can also be designed to support the movable system 1503 shown in FIG. 8C. Whereas all items described where 136A and 136B are in the embodiment now also apply to this case were they are not in the embodiment.

Another embodiment is to use a “U” bracket instead of a rod like 185 shown in FIGS. 8A and 8C. This “U” bracket can glide over all three surfaces of a sensor prism 136A and 136B or the mirror support structure as described above. This will add additional support to minimize or restrict tilt and linear translational variations or movement

Overview of Example Ray Trace

FIG. 9 shows a ray trace of light as it passes through an embodiment of the lens assembly 1600, traveling through lens surfaces L1 to L5, reflecting off of surface 1603, passing through lens surface L6 and on to the sensor surface 1602. In this embodiment, five fans of light rays are shown using different dashing for purposes of clarity in following the fans through the lens assembly 1600. Each fan is from a different point relative to the optical axis and far enough away to be consider being at infinity. As these rays travel through the optical surfaces L1-L6 they progressively cluster together as they move closer to the sensor surface 1602.

The embodiment of the lens assembly 1600 illustrated in FIG. 9 does not have structures 136A and 136B as shown in FIGS. 8A and 8B, instead showing only the mirror surface 1603 (135 in FIGS. 8A and 8B). The support structure holding the mirror surface 1603 is not shown and the support structure for the moveable structure 1503, as shown in FIG. 8C, is also not shown in FIG. 9. In FIG. 9 the rays are shown entering L1 from 5 different object heights in the camera's FoV of the object space and travel through the optical lens system L1 to L6 and end at the sensor image plane surface in 5 different image heights.

A prism or mirror 1603 is used to reflect the rays toward the image sensor 1602. Assuming that the lens L6 directly above the image sensor is not present, it becomes apparent the light rays must past a long distance from the last lens in the horizontal lens assembly (where horizontal refers to a plane parallel to the plane of the sensor surface 1602) to the mirror or prism 1603 and then arrive at the surface of the sensor 1602. Accordingly, lens surface L6, sometimes called a “Field Corrector”, is placed close to the image plane to make final corrections to the rays so they converge as close as possible to a point on the sensor surface 1602. Such a lens is placed close to the image plane, where part of its function is to make adjustments of the ray so they are better focused across the full image height. Lens L6 as demonstrated advantages due to its ability to afford minor corrections to the progression of light through the system which will enable the ability to image high resolution images on the image sensor surface, whereas a system without lens surface such as L6 may not meet the above mentioned performance and variation tolerance requirements.

Terminology

Implementations disclosed herein provide systems, methods and apparatus for auto-focus in a multi-sensor folded optic system. One skilled in the art will recognize that these embodiments may be implemented in hardware, software, firmware, or any combination thereof.

In some embodiments, the circuits, processes, and systems discussed above may be utilized in a wireless communication device. The wireless communication device may be a kind of electronic device used to wirelessly communicate with other electronic devices. Examples of wireless communication devices include cellular telephones, smart phones, Personal Digital Assistants (PDAs), e-readers, gaming systems, music players, netbooks, wireless modems, laptop computers, tablet devices, etc.

The wireless communication device may include one or more image sensors, two or more image signal processors, a memory including instructions or modules for carrying out the CNR process discussed above. The device may also have data, a processor loading instructions and/or data from memory, one or more communication interfaces, one or more input devices, one or more output devices such as a display device and a power source/interface. The wireless communication device may additionally include a transmitter and a receiver. The transmitter and receiver may be jointly referred to as a transceiver. The transceiver may be coupled to one or more antennas for transmitting and/or receiving wireless signals.

The wireless communication device may wirelessly connect to another electronic device (e.g., base station). A wireless communication device may alternatively be referred to as a mobile device, a mobile station, a subscriber station, a user equipment (UE), a remote station, an access terminal, a mobile terminal, a terminal, a user terminal, a subscriber unit, etc. Examples of wireless communication devices include laptop or desktop computers, cellular phones, smart phones, wireless modems, e-readers, tablet devices, gaming systems, etc. Wireless communication devices may operate in accordance with one or more industry standards such as the 3rd Generation Partnership Project (3GPP). Thus, the general term “wireless communication device” may include wireless communication devices described with varying nomenclatures according to industry standards (e.g., access terminal, user equipment (UE), remote terminal, etc.).

The functions described herein may be stored as one or more instructions on a processor-readable or computer-readable medium. The term “computer-readable medium” refers to any available medium that can be accessed by a computer or processor. By way of example, and not limitation, such a medium may comprise RAM, ROM, EEPROM, flash memory, CD-ROM or other optical disk storage, magnetic disk storage or other magnetic storage devices, or any other medium that can be used to store desired program code in the form of instructions or data structures and that can be accessed by a computer. Disk and disc, as used herein, includes compact disc (CD), laser disc, optical disc, digital versatile disc (DVD), floppy disk and Blu-ray® disc where disks usually reproduce data magnetically, while discs reproduce data optically with lasers. It should be noted that a computer-readable medium may be tangible and non-transitory. The term “computer-program product” refers to a computing device or processor in combination with code or instructions (e.g., a “program”) that may be executed, processed or computed by the computing device or processor. As used herein, the term “code” may refer to software, instructions, code or data that is/are executable by a computing device or processor.

Software or instructions may also be transmitted over a transmission medium. For example, if the software is transmitted from a website, server, or other remote source using a coaxial cable, fiber optic cable, twisted pair, digital subscriber line (DSL), or wireless technologies such as infrared, radio, and microwave, then the coaxial cable, fiber optic cable, twisted pair, DSL, or wireless technologies such as infrared, radio, and microwave are included in the definition of transmission medium.

The methods disclosed herein comprise one or more steps or actions for achieving the described method. The method steps and/or actions may be interchanged with one another without departing from the scope of the claims. In other words, unless a specific order of steps or actions is required for proper operation of the method that is being described, the order and/or use of specific steps and/or actions may be modified without departing from the scope of the claims.

It should be noted that the terms “couple,” “coupling,” “coupled” or other variations of the word couple as used herein may indicate either an indirect connection or a direct connection. For example, if a first component is “coupled” to a second component, the first component may be either indirectly connected to the second component or directly connected to the second component. As used herein, the term “plurality” denotes two or more. For example, a plurality of components indicates two or more components.

The term “determining” encompasses a wide variety of actions and, therefore, “determining” can include calculating, computing, processing, deriving, investigating, looking up (e.g., looking up in a table, a database or another data structure), ascertaining and the like. Also, “determining” can include receiving (e.g., receiving information), accessing (e.g., accessing data in a memory) and the like. Also, “determining” can include resolving, selecting, choosing, establishing and the like.

The phrase “based on” does not mean “based only on,” unless expressly specified otherwise. In other words, the phrase “based on” describes both “based only on” and “based at least on.”

In the foregoing description, specific details are given to provide a thorough understanding of the examples. However, it will be understood by one of ordinary skill in the art that the examples may be practiced without these specific details. For example, electrical components/devices may be shown in block diagrams in order not to obscure the examples in unnecessary detail. In other instances, such components, other structures and techniques may be shown in detail to further explain the examples.

Headings are included herein for reference and to aid in locating various sections. These headings are not intended to limit the scope of the concepts described with respect thereto. Such concepts may have applicability throughout the entire specification.

It is also noted that the examples may be described as a process, which is depicted as a flowchart, a flow diagram, a finite state diagram, a structure diagram, or a block diagram. Although a flowchart may describe the operations as a sequential process, many of the operations can be performed in parallel, or concurrently, and the process can be repeated. In addition, the order of the operations may be re-arranged. A process is terminated when its operations are completed. A process may correspond to a method, a function, a procedure, a subroutine, a subprogram, etc. When a process corresponds to a software function, its termination corresponds to a return of the function to the calling function or the main function.

The previous description of the disclosed implementations is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these implementations will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other implementations without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the implementations shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein. 

What is claimed is:
 1. An image capture system comprising: a plurality of cameras configured to capture a corresponding plurality of portions of a target image scene, a camera of the plurality of cameras comprising: an image sensor; a primary light folding surface configured to redirect light representing one portion of the corresponding plurality of portions of the target image scene in a first direction toward the image sensor; an optical element having an input surface configured to receive the light from the primary light folding surface, a secondary light folding surface configured to redirect the light in a second direction toward the image sensor, and an output surface through which light redirected by the secondary light folding surface propagates in the second direction toward the image sensor; a lens assembly comprising: a stationary portion having a first surface connected to the output surface of the optical element and a second surface connected to the image sensor, wherein the stationary portion of the lens assembly comprises a field corrector lens, and a movable portion positioned between the primary light folding surface and the optical element; an actuator configured to move the movable portion of the lens assembly along the first direction; at least one guide rail coupled to the actuator and the movable portion of the lens assembly, the at least one guide rail positioned to slidably engage another surface within the camera to constrain motion of the movable portion of the lens assembly away from an optical axis or rotating around the optical axis, the optical axis substantially parallel to the first direction; and a processor configured to generate a final image of the target image scene based at least partly on the corresponding plurality of portions.
 2. The system of claim 1, further comprising: a cover glass coupled between the image sensor and the stationary portion of the lens assembly, wherein the cover glass comprises a first cover glass surface coupled to the second surface of the stationary portion of the lens assembly and a second cover glass surface coupled to the image sensor; wherein the first direction, output surface of the optical element, first surface of the stationary portion of the lens assembly, first cover glass surface, second cover glass surface are generally parallel to a plane formed by the image sensor; wherein the optical element further comprises a guide surface extending generally parallel to the plane formed by the image sensor, the guide surface comprising the another surface within the camera which the at least one guide rail is positioned to slidably engage; wherein the input surface of the optical element is generally perpendicular to the plane formed by the image sensor; and wherein the secondary light folding surface extends at an angle between the guide surface of the optical element and the output surface of the optical element.
 3. The system of claim 2, wherein the optical element, stationary portion of the lens assembly, cover glass, and image sensor are adhered into a stack.
 4. The system of claim 1, further comprising a substrate, wherein the image sensor of each of the plurality of cameras is inserted into a slot in the substrate.
 5. The system of claim 1, further comprising one or more of glides, interlocking grooves, a magnetic field, oil, ball bearings, air, gas, or water positioned in contact with one or both of the movable portion of the lens assembly and the at least one guide rail in order to reduce friction.
 6. The system of claim 1, wherein the actuator is configured to move the movable portion of the lens assembly in the first direction within a space formed between an output surface of a prism comprising the primary light folding surface and the input surface of the optical element comprising the secondary light folding surface.
 7. The system of claim 6, wherein motion of the movable portion of the lens assembly in the first direction within the space between a near focus position and a far focus position is approximately 180 micrometers or less.
 8. The system of claim 6, wherein the near focus position corresponds to a focal distance of approximately 200 mm and the far focus position corresponds to a focal length of approximately 6767 mm.
 9. The system of claim 1, wherein the movable portion of the lens assembly comprises a plurality of lenses affixed to one another.
 10. The system of claim 1, wherein the primary light folding surface comprises one of a mirror and a refractive prism.
 11. The system of claim 1, wherein the optical element comprises one of a mirror and a refractive prism.
 12. The system of claim 1, further comprising an autofocus module storing instructions for focusing the movable portion of the lens assembly.
 13. The system of claim 1, wherein the optical element is positioned between the movable portion of the lens assembly and the actuator.
 14. The system of claim 13, wherein the at least one guide rail is coupled to a lower surface of the actuator at a first end and is coupled to a lower surface of the movable portion of the lens assembly at a second end.
 15. The system of claim 1, wherein a pixel pitch of a pixel of the image sensor is 1.1 μm.
 16. An image capture system comprising: a plurality of cameras configured to capture a corresponding plurality of portions of a target image scene, a camera of the plurality of cameras comprising: an image sensor; a primary light folding surface configured to redirect light representing one portion of the corresponding plurality of portions of the target image scene in a first direction toward the image sensor; an optical element having an input surface configured to receive the light from the primary light folding surface, a secondary light folding surface configured to redirect the light in a second direction toward the image sensor, and an output surface through which light redirected by the secondary light folding surface propagates in the second direction toward the image sensor; a lens assembly comprising: a stationary portion having a first surface coupled to the output surface of the optical element and a second surface coupled to the image sensor, and a movable portion positioned between the primary light folding surface and the optical element; an actuator configured to move the movable portion of the lens assembly along the first direction; at least one guide rail coupled to the actuator and the movable portion of the lens assembly, the at least one guide rail positioned to slidably engage another surface within the camera to constrain motion of the movable portion of the lens assembly away from an optical axis or rotating around the optical axis, the optical axis substantially parallel to the first direction; and a processor configured to generate a final image of the target image scene based at least partly on the corresponding plurality of portions, wherein the actuator is configured to move the movable portion of the lens assembly in the first direction within a space formed between an output surface of a prism comprising the primary light folding surface and the input surface of the optical element comprising the secondary light folding surface, and wherein the optical element comprises a refractive prism and further comprises a support block affixed to the refractive prism at the secondary light folding surface, a lower prism surface of the prism and a lower block surface of the support block comprising the another surface within the camera which the at least one guide rail is positioned to slidably engage.
 17. An autofocus device for a folded optic imaging system having an image sensor and a primary light folding surface, the autofocus device comprising: an optical element having an input surface configured to receive light from a primary light folding surface, a secondary light folding surface configured to redirect the received light in a second direction toward an image sensor, and an output surface through which light redirected by the secondary light folding surface propagates in the second direction toward the image sensor; a lens assembly comprising: a stationary portion having a first surface connected to the output surface of the optical element and a second surface connected to the image sensor, wherein the stationary portion of the lens assembly comprises a field corrector lens, and a movable portion positioned in an optical path between the image sensor and the primary light folding surface; a motor configured to move the movable portion of the lens assembly along an optical axis in a direction substantially parallel to a plane formed by the image sensor; and at least one guide rail coupled to the motor and the movable portion of the lens assembly, the at least one guide rail positioned to slidably engage another surface within the folded optic imaging system to constrain motion, of the movable portion of the lens assembly, away from the optical axis or from rotating around the optical axis.
 18. The autofocus device of claim 17, wherein the guide rail slidably engaging the another surface constrains motion of the movable portion of the lens assembly in the tilt, roll, pitch, and yaw rotational directions and in the linear X, Y, and Z directions.
 19. The autofocus device of claim 17, wherein the motor comprises one of a voice coil motor, piezoelectric stepper motor, micro electrical mechanical system, or a shape memory alloy.
 20. The autofocus device of claim 17, further comprising a cover glass affixed over a light receiving surface of the sensor, wherein the field corrector lens is coupled to the cover glass.
 21. The autofocus device of claim 17, wherein a pixel pitch of a pixel of the image sensor is 1.1 μm.
 22. An image capture apparatus comprising: means for capturing a plurality of portions of a target image scene comprising a means for sensing; and an optical element having an input surface configured to receive light from a primary light folding surface and an output surface through which light redirected by the optical element propagates towards the means for sensing; the means for capturing further comprising a lens assembly comprising a stationary portion having a first surface connected to the output surface of the optical element and a second surface connected to the means for sensing, wherein the stationary portion of the lens assembly comprises a field corrector lens, and a movable portion positioned between the means for sensing and the primary light folding surface; means for moving the movable portion of the lens assembly; and means for controlling motion of the movable portion of the lens assembly by slidably engaging at least one surface within the image capture apparatus.
 23. The apparatus of claim 22, wherein the means for capturing a plurality of portions of a target image scene comprises a plurality of folded-optic cameras configured in an array.
 24. The apparatus of claim 22, wherein a pixel pitch of a pixel of the means for sensing is 1.1 μm. 